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Background 


e GLSL IR SSA work 

- Yuck! 
e Modify the IR? 

- Basic blocks, flat, SSA, ... 
e Make a new one? 


Goals 


Flat, SSA-based 

Basic block based, but... 

Preserve original control structure (loops, Ifs) 

Support high-level GLSL things, but... 

— Linking, array splitting, structure splitting, etc. 

Also support low-level things 

- DRY — don't make drivers copy optimizations unless they have to! 
Try and make it extensible, but... 

Don't try and support everyone's wacky HW! 


Control Flow 


ena pos 
e = q wea 
} | on 


if (...) { 
If kara 1 
} else { 


Critical Edges 


if (...) { 
If kara 1 
} else { 


Critical Edges 


Traditional Vec4 Model 


e Gallium (TGSI), 1965 FS/GS, Mali 400/600 
e Dying out (yay!) except for mobile 


e Everything: varying interpolation, variable 
indexing etc. in terms of 4-vectors 
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Traditional Vec4 Pipeline 
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Vec4 + SSA = ?? 


e Masked writes not possible in SSA 
- No partial updates allowed 
e Need a way to combine two SSA values 
— foo.xy = bar; 
- Becomes... 
— newFoo = vec4(bar, oldFoo. zw) ; 


e Out of SSA becomes tricky 


The Vec4 Pipeline/Modifiers in NIR 


Abs, negate, etc. easy for scalar backend 
But not for vec4 backends 


Don't want old backends to have to do more 
than necessary 


Have to fold abs, neg, etc. before out-of-SSA 
- Or keep around tons of metadata 


It's in there, but most of the time we can ignore 
it 


Variables 


e Mostly copied from variables in GLSL IR 
e Ways to use them: 

— Intrinsics (load/store/copy, etc) 

— Function arguments & return values 

- Samplers for texture instructions (before lowering) 
e Differences from LLVM 

- Entirely logical (no notion of alignment) 

- Can't take the address — no notion of pointers! 

— Supports GLSL concepts without extra metadata 


Registers 


Main goals: 
— Make it easier for older backends that don't do SSA 


— Sharing code for lowering arrays of structures or structures 
with arrays 


- Avoiding backend optimizations to do with array indexing 
Can create an array of given vector width and size 
Or a scalar array that can be accessed anywhere 
Indexing (stride, alignment, etc.) up to backends 


Intrinsics 


Almost everything that can't be constant folded 
4 arguments: 


Register/SSA value input 


Register/SSA value output 
Variables 


Constant integers (indices, sizes, etc.) 
Additional semantic bits 

- Can we delete this intrinsic? Can we reorder it? 
Defined statically by macros (nir intrinsics.c) 


Todo 


Make 1965 FS backend use SSA 
Add 1965 Vec4 backend 

fp64? 

More precise semantics for intrinsics 
Optimizations! 


Questions? 


